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Abstract. Fixed point iterations are known to generate chaos, for some values 
in their parameter range. It is an estabhshed fact that Turing Machines are 
fixed point iterations. However, as these Machines operate in integer space, 
the standard notions of a chaotic system is not readily applicable for them. 
Changing the state space of Turing Machines from integer to rational space, the 
condition for chaotic dynamics can be suitably established, as presented in the 
current paper. Further it is deduced that, given random input, computation 
performed by a Universal Turing Machine would be ^almost surely' chaotic. 



According to the Church- Turing thesis, there is an abstract comput- 
ing device, called Universal Turing Machine(UTM), which can simulate 
any effective computation. It well known that a UTM is a type of iter- 
ative map or dynamical system. Dynamic Systems can exhibit chaotic 
behavior, and in this paper condition of chaos in a UTM has been de- 
rived. It has been formally proven that, given a random computation 
to simulate on any UTM, the resulting dynamics will be 'almost surely' 
(with probability 1) chaotic. 

1. Introduction 

Alan Turing's abstract computing device, Turing Machine, is in effect a fixed 
point iteration. While computation is believed to be of predictable nature, generic 
fixed point iterations do not always behave similarly. Some iterations do indeed 
converge to a fixed point, while others diverge to infinity, yet a few oscillate between 
multiple points. But in some cases, the iterations neither converge, nor diverge, 
and do not even oscillate between points. This deterministically random behavior 
(Section [2]) is informally termed as chaos. 
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Many theoreticians believe absence of direct connection between computation 
and chaos. However, many dynamic systems which are capable of simulating 
Universal Computation for some configurations, also exhibits chaotic behavior for 
some other configuration |16] . Most prominent example of this behavior is cellular 
automata [16]. 

In Section ([3]) a brief discussion establishes Turing Machines as fixed point iter- 
ations, and lays the foundation of our work. Section demonstrates the concept 
of Chaos in the Universal Turing Machines. Further, it follows (Section [5]) that 
Universal Computation, 'almost surely' , will he chaotic. 

As a closure (Section E]) three philosophical implications of this finding are dis- 
cussed. 

2. Fixed Point Iteration and Chaos 

Let us define a function, / : X — )■ X , where its domain and range are the same. 
Starting with an initial input 'xq' to the function, let '/' produce the output 'xi', 
such that, 

xi = fixo). 

The output of the function in each step can be taken as the input to next step 
and hence, 

Xn = f{Xn-l) ; n > 1 (2.1) 

Equation 02. ip is called a fixed point iteration, and is the simplest example of a 
dynamic system 

Iteration (12. ip may result in Xn being converged to a limit point, diverge to 
infinity, or even exhibit chaos. Banach Fixed Point theorem lA.H presented in 
Appendix-E| details the condition under which the iteration would converge. 

For a fixed point iteration of type (12.10 , defined over some interval /:/—)■/, 
the inverse of the Banach theorem lA.ll has an interesting implication. Iterates will 
not converge to one point if at least one of the following conditions is true. 

(1) Function '/' has no solution in / : ^x* G / s.t. f{x*) = x* . 

(2) The interval / is not complete . 

(3) is not a contraction mapping satisfying ( lA.ip . 



The iterate (12. ip . then, will not converge, and will either diverge or may even 
run in a chaotic manner. 

However, Chaos is a tricky term to define. It is much easier to list down proper- 
ties that a chaotic system must possess, than giving a precise definition of chaos. 
Appendix [A] has the related definitions. 

Definition 2.1. Characteristics of Chaotic Dynamics. 

(1) Sensitivity to the initial condition of the system (where the neighborhood of 
the initial point can quickly lead the system into very different final states.). 

(2) Having a dense (definition \A.9\) collection of points with periodic orbit (def- 



inition 1X7^ ). 
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(3) Topologically mixing (definition \A.10^) . 

(3) and (2) in the definition (12. ip imply the sensitive dependence on initial 
conditions (1). 

Some dynamical systems, like the one- dimensional logistic map (equation (12. 3p ) 
with r = 4.0, are chaotic everywhere. However in many cases chaotic behavior is 
found only within a subset of phase space. 

We present three functions exhibiting chaotic dynamics in the following subsec- 
tion. 

2.1. Chaos in the Fixed Point Iterations. 

2.1.1. The Chaotic Search for i = The Babylonian method employs an 
iterative map to calculate \/S with arbitrary precision. 

1 S 

Xn+i = -{xn + —);n>0;XneR (2.2) 

Z Xn 

When < , there is no G M that would satisfy x*"^ = S. The iteration can 
clearly not converge. It is discussed in [5j that the iteration eventually becomes 
chaotic. 

2.1.2. Chaos in the Logistic Map. Logistic map is a very well known example of 
an iterative map demonstrating chaotic dynamics :- 

x„+i = rx„(l - x„) ; r G [0,4] ; X G (0, 1) (2.3) 

When r = 3 , the system bifurcates into two fixed values. If r is increased further, 
the bifurcations continue further, till reaching a chaotic state at r = 4. There are 
numerous discussions in [T7] of this behavior. 

2.1.3. Chaos in Aperiodic, Bound, Non Convergent Fixed Point Iterations. We 
show here that such a function in fact exhibits chaotic dynamics, much like the 
Babylonian method to search for Demonstrating chaos in an iteration would 
require one to show that the system is topologically mixing, having a dense orbit. 

Lemma 12.11 proves the topologically mixing property of the system. The con- 
struction used in the lemma can be used to show that the orbit is dense. 

Lemma 2.1. Topological mixing in an Aperiodic, Bounded, Non-Cauchy 
sequence. 

Let M =< Xn > be a sequence with x„ = /(x„_i) with f : X ^ X such that:- 

(1) Vn ; I < Xn < u , where —oo<l<u<oo are its hounds ; 

(2) Xi = Xj =^ i = j ; that is the sequence has a period of infinity ; 

(3) < Xn > is not a Cauchy Sequence ; 

Then, there exists a set \E' C X where 'f ' is topologically mixing. 
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Proof of the lemma lKTi Using lemma IA.2I (in Appendix [A]) we assert that, for 
JV =< Xn > , there exists a countable set, C, of Cauchy sequences, which are 
subsequences of JV. We assume \C\ = n. 

Clearly every Cauchy sequence in C, Ci , has a limit point Oj. If ^ X , X 
must be an incomplete metric space (for example X C Q ). We can use arbitrarily 
close approximation of by virtue of density of rational numbers (Q) in real (M) 
set ( by definition IA.9I ) and name it ai. If X is a complete metric space, we define 

Clj Qjq^ . 

The set of approximate limit points, so constructed, can be defined as follows. 

^ = {a; I ai = lim{Ci)} 

where each aj is distinct. 

Cauchy subsequence, Ci , whose limit point is ai eventually will be entering the 
interval /j = |x — ai| < e, where e is an arbitrarily small number. Hence, every 
single point be included in the Cauchy subsequence Cj. 

Now, as Ci is a Cauchy sequence, 3Ni such that, 

Wi : n,m>Ni =^ \xm — Xn\ < e; 

However, these sequences are 'mixed' or 'interleaved' with one another. Assume 
that fcj'th element of the sequence JV is identified with the Xj'th element of the 
sequence, Ci. 

We can now generate the set K = {ki, ^2, fca, kn}- 

Obviously, there is a Km = MAX{K), where MAX is the maximum value of 
a set. Km would vary depending upon the choice of 'e'. Nevertheless, there will 
always exist one Km for any choice of 'e'. 

Based on a unique choice of e , hence Km, we have, 

Vi : Xn, a^m ^ j n,m > Km — ^ |a^m Xn\ < ^ j 

Now, we construct the sequence A/"* = {xj G A/" ; i > Km }■ 
Let the set of ordered limit points { i < j =^ ai < aj ) be, 

A* = {ai, a2, as, an} 

We create a set of 'ra' sequences, A = {^*,^2)'^3) •■■j-^n}' follows:- 

(1) Take x* G M* , and remove it from Af*. 

(2) Find all di{x*) = \x* — ai|. Let the minimum be achieved at i = k i.e 
dk{x*) = MIN{di{x*)), where MIN stands for the minimum value of a set. 

(3) Assign X* to the A;'th newly created sequence : Al- In case there are two 
sequences {l,k, I < k) satisfying M/A^((ij(x*), assign x* to the one having 
lower index i.e. : A*i. 

(4) Repeat the above steps (1,2,3) till M* is exhausted. 

By virtue of construction, we achieved 'n' centroid clustering of the sequence 
M* . The same centroids are also present in M. 

Every sequence A*, has k = MIN{A*) and Ui = MAX (A*). 
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Using li and , center ai to create intervals as follows:- 

I^=[li,U^] (2.4) 

We now show that the neighboring intervals Xj and Xj+i are disjoint. 
The construction of the intervals implies the following. 

X ^ .2^2 I X cij I ^ I X cij I 
tij - ai < ai+i - Ui =^ Ui < ^(ai + ai+i) (2.5) 

^i+i — k+i < k+i — ai =^ > -(ai + ai+i) (2-6) 

Combining relations (12. 5p and (12. 6p we find, Ui < /j+i. However, as asserted, no 
elements repeat in A/" and subsequently in A^*. Hence Ui ^ , and ttj < /j+i. 
Therefore, all the Xj G {X^} are disjoint, 

iy^j =^ Xi nXj = (2.7) 

Now, we construct a set \E' as the union of all the 'n' disjoint intervals corresponding 
to 'n' sequences, A = {A*}. 



n 



^ = |JX, (2.8) 

i=l 

Using equation (12. 7p and (12. 8p . we can say that the set, X = {X/j}, defines a 
basis for topology (definition IA.8I ) in \E'. 

As the function is aperiodic, it is clear that the function '/' mixes all the Xj 
intervals, so that 3n > such that, 

V2,j r(X,)nX, ^0 (2.9) 

Now, consider two non empty open sets : A, S C vj/ ^ such that APi B = 0. 
That would mean, 3Xfc ,Xr such that, 

BnZky^ ] Anik = 

and 

5 n X, = ; AnZr^0 
Using equation (12. 9p . we can say, 

r(x,)nXfc^0 

This will readily imply, 

r{A) r\B^0 

which proves the assertion about mixing. □ 

Theorem 2.1. Any iterative map whose orbit is Bound, Aperiodic and 
Non-Cauchy, is chaotic. 

Let f : X X he a map where X is an interval. Let the orbit, O, of f be 
bound, aperiodic and Non-Cauchy. Then, f is chaotic. 
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Proof of the theorem \2.1\ . Invoking Lemma 12.11 we can say that the function is 
topologically mixing within \1/ C X. \1/ is defined as in f|2.8p . 

We now show that the orbit of the function / is dense in \[^. To do that, we 
take any arbitrary point x G ^E'. By definition, \I/ is an union of non overlapping 
intervals Xj (12. 4p . Clearly then, 

V « 7^ j ; X EXi =^ X ^ Xj 

By virtue of construction (Lemma 12. ip of Xj :- 

Xi,X2 G Xj =^ \xi — X2\ < e. 

This construction would imply that 3a;* G Xj such that:- 

|x — x*| < e 

By definition of dense set (definition IA.9I) then, the orbit (9 of / is dense within 

It is evident that '/' exhibits both dense orbit, and is topological mixing. By 
definition 12.11 of Chaotic system, then, / is chaotic. However, its orbit O is not 
periodic. □ 

Computer simulation of the following iterative function, 

Xn = tan{xn-i) (2-10) 

displays above behavior as experiments have shown. It is bound, and its singular 
points are irrational in nature, which can not be approximated well enough in a 
digital computer (except x = ). The aperiodic nature is conjectured, as even 
after a billion iterations there was no repeat experimentally recorded. All the 
three distinct features of chaos can be seen as this function iterates. 

3. Computation and Iterative Maps 

A Turing Machine (Appendix [B]), after performing a computation typically 
writes back to the tape. Every write in the tape can be considered a step, and the 
state of the tape (the ordered sequence of symbol on the tape) can be denoted by 
T. By definition, thus, at every step, a state transition takes place based on the 
tape sequence. 

Let the state of the tape at step 'ri,' be designated as 'T„'. At the 'r2, + I'th step, 
the state of the tape changes to T„+i'. 

Hence, a specific Turing Machine, 'M', while computing, changes the state of 
the tape from T„_i — )■ T„ , and that is the computation that took place in the n'th 
step. 

Hence, a computation can be written as:- 

T„ = M(r„_i) (3.1) 

Comparing equation (12. ip with equation (13. ip . we can suggest that Turing Ma- 
chines are a very specific type of iterated maps. This has been discussed in [IB] , 
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and a formal proof based on neural network is established in [12j. The following 
theorem is due to Hyotyniemi. 

Theorem 3.1. (Hyotyniemi) 

Turing Machines can he written as non linear discrete time system, of the form:- 

Xk+i = g{xk) ] k>o 

This theorem obviously leads to the following lemma : 

Lemma 3.1. Universal Turing Machines are Iterative Maps. 

Proof of the Lemma \3. il Every Turing Machine is an iterative map. A Universal 
Turing machine, being a Turing machine is one. □ 

3.1. Phase Space of Turing Machine. According to equation fl3.ip , the tape 
of the Turing Machine is the iterate variable, with symbols from F. But to be able 
to represent the phase(iterate variable) of a Turing machine, the tape needs to be 
converted into a number. Hence, we need to discover a way to map every Tn into 
a number. This conversion is established using Gddelization techniques. 

Definition 3.1. Gddelization (Gddel). 

Any string from an alphabet set T can he represented as an integer in hase 'b' 
with b = |r|. To achieve this, create a one-one and onto Gddel map g : ^ Db , 
where, 

A = {0,1, 2, ...,6-1}. 

Gddelization or G : S+ — t- Z+ then, is defined as follows: 

A string of the form w = Wn-iWn-2---WiWo , with Wi , can he mapped to an 
integer 1^ = G{w) as follows: 

n-l 

I^ = Giw) = Y,9iwk)b'' 

The common decimal system is a typical example of Godelization of symbols 
from {0, 1, .., 8, 9}. The binary system represents Godelization of symbols from 
{0, 1}. As a far fetched example, any string from the whole english alphabet, can 
be written as a base 26 integers! 

Definition 3.2. Rationalization. 

Any string 'w' of length 'n' (\w\ = n) , created from an alphabet set T, can 
be represented as a rational number x E Q. We define the rationalization, p , in 
terms of Gddelization ( definition \3. 1\) as follows : 

X = p{w) = —— = = 0.Wn-lWn-2---W0 

By definition, x E [0, 1]. 
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The definition fl3.2p of rationalization effectively means that we are implicitly 
adding a decimal point to the left of the representation to convert it to a floating 
point (rational) number in base 'b'. 

Based upon the definitions of (13. ip and (13. 2p . we now define the phase of a 
Turing Machine as follows : 

Definition 3.3. Phase of a Turing Machine. 

The phase of a Turing Machine having right infinite tape, at step 'n' with respect 
to a particular Gddel map 'g', is the rationalization (definition \3.^ ) of it's tape 
state, Tn i.e. , 

By definition, then, the phase is always hound inside the interval [0, 1] . 

Definition 3.4. Phase Space of a Turing Machine. 

From the earlier definition of phase, the phase space of a Turing Machine is a 
metric space, defined as below : 

Xt = {x\x G Q ; < X < 1} 

The associated distance function, 'd{x, y)' , of the above phase space is defined as 
d{x,y) = \x — y\.It induces an incomplete metric space. 

Definition 3.5. Turing Machine is a discrete time Dynamic System. 

A Turing Machine can be thought of as an iterative map, M : Xt ^ Xf . 

4. Chaos And The Turing Machines 

Undecidable problems in Computation are more interesting from the point of 
view of chaos, as a decider would surely halt. The end result would not be chaotic. 

The famous "Halting Problem" (Appendix [B]) is an example of one such unde- 
cidable problem which was the first of such to be discovered. 

4.1. Implication of the Halting Theorem. We define a debugger machine 
which contains a Universal Turing Machine (definition [R4]) as "sub"-machine within. 
It is capable of executing the instructions of the "simulated" Turing Machine one 
instruction at a time. This machine is also capable of reading the whole tape of 
the Universal Turing Machine (till the blank symbol), rationalize (definition 13. 2p 
it, and put that number into a table. 

By the definition of rationalization (definition 13. 2p . it just involves copying the 
current content of the tape, end to end. For one sided tape bounded at left, it 
copies from the start of the tape to the first blank symbol. The real number 
is implicit from the content of the tape. The decimal point is assumed to be in the 
left side of the tape. 

Definition 4.1. Debugger Machine. 

A "Debugger Machine" is a 3-tuple of Am = {H table ■, ^utmi S) where. 
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(1) 'Mutm' i'S a Universal Turing Machine. It will be simulating a Turing Ma- 
chine with some input, where output of the simulated turing machine at 
any step is with the left bounded, right infinite tape T 

(2) 'Htabie' is a tabular sequence < hn > where rationalized 'T' is stored at 
every step. 

H =<hn> ; hn = p{Tn). 

Clearly, H C Xt . 

(3) S = {Y, N} is a set of Symbols to be output for a simulation. It will output 
"Y" where system halts, else it will output "N". 

The machine would operate as follows :- 

(1) The tape 'T' of Mutm is populated first with the description of a Turing 
Machine 'M', and the input to simulate 'M' with. 

(2) The machine Aj\,f would next rationalize the tape 'T' , and store it in table 
H={p{T)}. 

(3) The machine A^,/ now, will use Mutm to execute '5' transitions unless a 
transition leads to writing to the tape. 

(4) Now, Ajv^ will rationalize the current tape, p(T„) , and search it from 

(5) If p{Tn) G i/, the machine 'M' encountered a loop. Halt execution, and 
report a loop by outputting "N" . 

(6) If p{Tn) ^ H , then, insert p(T„) in 'if', and resume from (3). 

(7) If the Mutm has halted, output "Y", for success. 

One example of such computing would be computing the square root with Babylo- 
nian method using equation (12. 2p . The Tape 'T' then would have the That 
is, the tape is the scratch pad for the Universal Turing Machine, it won't contain 
the encoding of the embedded Turing Machine. 

4.2. Debugger and the Halting Problem. The debugger machine (definition 
14. ip is nothing but a would be decider Turing Machine. By the Halting Theorem 
(definition IB. ID Aj\/ is guaranteed not to halt at every input for the tape T'. 

In case A^f does not halt, the generated sequence 'i/' needs to be carefully 
analyzed. 

Clearly, H at this stage {/iq, /ii, /i2, hn} is a bounded sequence, as 

< /in < 1 ; n G [0,oo) 

At this stage there are only three possibilities with the sequence H =< hn > '■- 

(1) terminates finally and becomes a finite sequence, due to 

(a) either debugger machine halting, 

(b) or nothing further is written on the tape, ever after. 

(2) becomes infinite Cauchy Sequence. 

(3) becomes infinite but remains a Non-Cauchy sequence. 

This 3rd behavior establishes the next theorem:- 
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Theorem 4.1. Chaos in Computation (Debugger Machine) . 

A non halting Am , exhibits chaotic dynamics when < hn > is non terminating, 
and is not a Cauchy Sequence. 

Proof of the Theorem \4-l\ We note that the Turing machine Am is a map:- 

and under the condition of the theorem the orbit < /i„ > is aperiodic, bound and 
Non-Cauchy. Furthermore, Xt = [0, 1] is an interval. Therefore, we can invoke 
theorem 12.11 to conclude that:- 

Am would exhibit chaotic dynamics. 

□ 

It is known that Universal Computation can be Devaney-chaotic yjj. This 
theorem reiterates the same from the perspective of Turing Machine. 

The theorem 14. II has significant practical consequences. By tracing the sequence 
< hn > for a long time, we can check if it indeed has been behaving like a con- 
vergent (Cauchy) sequence. If that is so, we can heuristically take a decision on 
halt the machine, assuming that the system would reach the limit point of the 
underlying Cauchy sequence. 

However, clearly that is a probabilistic approximation. There is no guarantee 
that the sequence will remain Cauchy after the observation. 

Theorem 4.2. Equivalent Classes on the Sequence < hn >. 

The < hn > sequences can be categorized into three mutually exclusive set of 
sequences. 

(1) Finite Sequences T : These halt the debugger machine or do not write onto 
the tape anything after finite time. 

(2) Infinite Cauchy Sequences, C, each converging towards their limit points in 
R. 

(3) Infinite Non-Cauchy sequences , Af , which are chaotic sequences. 

Proof of theorem \4.2\ The proof is trivial using theorem 14.11 □ 

5. Probability, Chaos and Turing Maghines 

In this section we discuss the implication of the theorem 14 . 1 1 using very standard 
notion of 'a.s' within the realm of probability theory. 

Appendix O has all the definitions needed to define the axioms of probability 
theory. 

Lemma 5.1. Cardinality of the sets T , C and J\f. 

On the interval of rational numbers, Q = [0,1] , let :- 

(1) S be the set of all sequences, finite or infinite, such that Xi = Xj =^ i = 3 
(no elements repeat ) ; 

(2) d S be the set of all finite sequences (without repetition) ; 
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(3) C G S be the set of all Cauchy Sequences (infinite, without repetition) ; 

(4) -^f G S be the set of all Non- Cauchy Sequences (infinite, without repetition). 

Then T is null set with respect to S, with \S\ = \C\ = \J\f\- 

Proof of lemma \57T\ We note that, 

S = FVJCVJU (5.1) 

The set S is created by drawing one rational number at a time, without replace- 
ment from the set of rational numbers. Clearly then, we have, 

\S\ = IQI X |Q-1| X |Q-2| X ... 

As |Q| = Kq, the above can be written as, 

|5| = Ko(Ko-l)(Ko-2)... 

Using aleph arithmetic, we get, 

\S\ = KoKoKo... = K^" 

Since 

K^o = 

we must have:- 

\S\ = (5.2) 

The set J-" can be mapped one to one with the Natural Numbers Set. It can be 
thought of as the set of numbers having finite decimal representation in base '6' by 
virtue of the rationalization (13. 2p . Then it is pretty obvious to arrive at equation 

1^1 = Ho (5.3) 

Using equations ( 15. 2 p and (15. 3 P we can conclude that J-" is null set w.r.t. S. 
It is well known that the cardinality of the set C is the cardinality of continuum, 
that is:- 

\C\ = Ki (5.4) 

Using lemma ( ]A.2p we can say that every single Non-Cauchy Bounded Sequence 
comprises countable Cauchy sequences. 

Consider the general class of Non Cauchy sequences, M{k), which are mixture 
of 'fc' distinct Cauchy sequences. Clearly, 

C=M{1). 

Then clearly:- 

Wm = (''■^^^^'V/(fc) = ('^')M(fc) = = ^i^W (5-5) 
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where M{k) is the all possible way 'k' chosen infinite ( ) sequences can be 
mixed. However :- 

Ho 

X=\Jj^^k) (5.6) 

k=2 

with disjoint J\f{k) :- 

Using the cardinality from equation (15. 5p and (15. 6p with the disjoint property we 
do get:- 

Ho No 

w\ = '£wm = Y.'^iM{k) 

k=2 k=2 

Using continuum hypothesis (definition IC. 41 ) the bounds for M{k) becomes: 

Ko < M{k) < Ki 

then, this would imply : 

^o^i^o < lA^I < Ko^i^i 
which gets back immediately to:- 

lA^I = (5.7) 
Hence, using equations (15 ■4p and (15. 7p we get:- 

\C\ = \Af\ = Ki 

which proves the theorem. □ 

That the set S has cardinality equal to the its own subsets, i.e. C and Af should 
not surprise us. An example of this is the set of integers, which has the same 
cardinality Kg as that of the individual sets of even and odd integers. 

The set. A/", is in some sense bigger than C even though they have same cardi- 
nality. Other examples exhibiting similar behavior include the set of continuous 
intervals within a line. The intervals have same the cardinality (Ki) as that of R". 
In effect the cardinality of the points in a line is the same as countably infinite 
dimensional Euclidean space. 

This brings us the question. What would be the chance that a randomly chosen 
sequence s E S would belong to C or A/" ? After all, the chance that a number, 
selected from a real ray [0, oo) would belong to any bounded line segment [a, b] , 
is zero. The idea is formalized next. 

Definition 5.1. Sequence to Space Mapping Function. 

Let us define S{n) to be, 



S{n) = [jAf{k) {5.1 



k=l 
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and let:- 

R" = h] ; n e [0,1] ; < k < n ; i< J ^ n < rj (5.9) 
Then, the function :- 

fs : S{n) ^ R" (5.10) 

will be called Generalized Sequence to Space Mapping Function. The co-ordinates 
are found as follows. 

Let lim{Ck) denotes the limit of the Cauchy sequence C^. Let s G S{n) he a 
mixture of Cauchy sequences \{Ck}\ = m. Then, 

\lim{Ck) ifO<k<m 
ffc = < (5.11) 
I lim{Cm) if m < k <n 

In particular, if the sequence s G S{n) is also in C, with lim{C) = r then 's' will 
be mapped to rc where, 

rc = N ; Vz r, = r ; r G [0, 1] (5.12) 

We note that the curious r G [0,1] is the outcome of rationalization of the 
sequences as definition (13. 2 p with all the Cauchy sequences having individual limit 
points, c G [0,1]. If we have a sequence 'S2' of two Cauchy sequences mixed, 
individually converging to points {a, b}, with a < b, we would denote the sequence 
as S2 = (a, b). This ensures that the permutations of components does not matter. 

Definition 5.2. Sequence Space. 

Every sequence s G S{n) can be mapped into R" (equation \5.9\} using the se- 
quence space function fs ( equation \ 5.1U\) of definition (15. ip . The space so con- 
structed is to be defined as sequence space. 

Definition 5.3. Sequence Measure. 

Let A C S{n) . The sequence measure of A is defined as, 

fi{A) = n\V,{A) 
where Vn{A) is the volume occupied by the set A G M". 

It is easy to check that the sequence measure (definition 15. 3p follows all the 
properties required for a measure (definition IC.2p . Moreover, '/u' is a probability 
measure, because, 

n{S{n)) = n\V{S{n)) = 1 ; Vn > 0. 

Theorem 5.1. Measure of Cauchy Sequences is over Sequence Space. 

Let C C S{n). Then, 

Vn{C) = ; n > 1, 

and subsequently fi{C) = 0, under the tenets of sequence measure (definition \5.3\} 
over sequence space (definition \5.2\} whenever n > 1. 
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Proof of theorem I5.il We note that under the effect of fs (equation IS.lOp of defi- 
nition flS.ip the Cauchy sequences are given by the equation fl5.12p . 

rc = [ri] ; Wi ri = r ; r e [0,1] 

represents a diagonal inside an 'n'-dimensional hypercube for n > 1. For n = 1 it 
is M. Then, for the case of n = 1, the volume measure gets replaced with length 
measure and we get :- 

Vi{C) = A(C) = 1 when C C 5(1) 

For n = 2, it is the diagonal of a unit square, where the volume measure has to be 
replaced by the area measure, and we get :- 

V2iC) = A{C) = when C C 5(2) 

For > 3 it is the diagonal inside a unit hypercube, and hence:- 

K(C) = when C C S{n) ; n > 3 

Hence, the set C C S{n) with n > 1 is a null set with sequence measure (definition 
15. 3p (volume measure) 0. □ 

With this discussion in mind, we formally present the theorem that states: 
Universal Computation, as debugger machine Am runs it, is 'almost surely' 
Chaotic. 

Theorem 5.2. Universal Computation performed by Debugger Machine 
is 'Almost Surely' Chaotic. 

Let the machine Am be given the task to simulate a random Turing Machine, 
Tm , on a random input. Then, the sequence H =< hn > of Am will be chaotic 
with probability '1 '. 

Proof of theorem 15. Under the tenet of , 5 is the sample space. Hence, 

P{H eS) = l 
We note from equation 05. ip that :- 

P{H eS) = P{H + P{H eC) + P{H g A^) (5.13) 

The probability that H will have a finite length is given by Lemma 15.11 

P{Finite) = P{H eJ^) = 0. (5.14) 

Hence, the Am will almost surely not be of finite length. This implies that the 
debugger machine 'almost surely' won't halt. Hence, 

P{H G CUA/") = 1. 

We note here that the sequence measure (definition 15. 3p is a probability measure. 
That is because in the general case, the measure adds up to the volume of a unit 
hypercube, as defined. In this regard, theorem (15. ip implies:- 

> 1 ; Vn{C C S{n)) =0 =^ P{seC) = 
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for any S{n) with n > 1. However, we note that:- 

S = lim S{n). 

Applying Lemma (15. ip and theorem (15.11) . for H =< hn >, we get, 

P{H e C) = P{H e = (5.15) 

Clearly then using equations (I5.13P and ( I5.14p and f l5.15p , the probability that 
the machine Am does eventually end up in a Non-Cauchy sequence is, 

P{HeM) = l (5.16) 

However, using theorem (12. ip we can say, when H & Af , that is the condition of 
chaos. Hence, 

P(< K > is Chaotic) = P{H e M) = I; 
which establishes the theorem. □ 

6. IMPLICATION 

Theorem (15. 2p actually means that:- 
A randomly drawn discrete state Universal Computing Mechanism, given a 
random input, would behave chaotically 'almost surely'. 

The notion "chaotically" is being defined as a condition when sequences of the 
tape symbols comprise multiple Cauchy sequences. Both the terms chaotic and 
multiple Cauchy have deep implications, which is discussed further in the following 
subsections. 

6.1. Halting Problem : A New Perspective. We start with the celebrated 
halting problem (1B.6P itself. Yes, it is a fact that we can 'almost surety' conclude 



that Turing Machines would not halt. We know now that this is due to the presence 
of chaotic sequences! We will demonstrate that in the below paragraph. 

Let's assume for the moment that somehow in some universe the sequences 
'almost surely' do not get chaotic. In that case, 

P{H) = P{< hn > is Convergent ) = 1 

which would mean, the only problem is the precision of the result, not the result 
itself! One can halt a computation just by being imprecise. The situation is 
not much different as of today. Any attempt to find an irrational number with 
arbitrary precession would not ever halt. But then, with the choice of precision, 
we can halt it any time, with probability 1. 

However, due to the chaotic component (imprecise term) we can not do it for 
any input, or any sequence. Thus, the crux of the halting problem stems from 
the undecidability of 'the fixed points of convergences '. This in turn represents the 
chaotic part of computation. 
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6.2. Simulating Neurons : Brain and Mind. From time immemorial, the 
debate has always been around a mechanistic view of "mind" and the workings of 
the brain. The mechanistic view applies Church Turing Thesis, upon brain, and 

treats it as just another computing device. This makes mind just as powerful as a 
Turing Machine. This view expresses the possibility of Turing Machines simulating 
mind! 

However, as we already established. Universal Computation can be chaotic. 
So even if brain works as a "Universal Computer", it might not be enough to 
predictably simulate it. The problems are two fold. 

(1) The problem lies in applying the model of Turing machine. There is no 
provision to change data on the tape non deterministically in a Turing 
Machine. Real world behaves differently, where data changes non deter- 
ministically. Thought process, no matter how deep, eventually halts. Our 
perception changes based on data gathered by our senses. Mind is clearly 
not a "constant program, constant data" computation. 

(2) Another problem is due to the intrinsic property of any chaotic system, 
i.e. sensitive dependence on initial conditions. How can one set the initial 
conditions? With what precision ? Rational numbers do not have finite 
precision. Clearly, even if we solve the problem of finding all the input 
parameters, a very small loss of precision would guarantee a totally different 
behavior of the simulated system under Turing Machine model. 

6.3. Learning Processes. We reflect about the learning process, where patterns 
are present within large set of data set X, combined from different sources. The 
learning process is all about discovering a function, : X — > X such that JC C X 
can be recreated by using the fixed point iteration map Ck ■ If that is indeed the 
case, we may say that the system has learnt Ck ■ 

But then there might be a set of such functions C = {Ci} operating on the 
domain X. Generalized learning is all about discovering such Cj. and adding them 
to the set of known C 

A Randomly Drawn Turing Machine might be capable of producing the set /C , 
alongside other sets. From the biological learning perspective, this has enormous 
value. Such a system is capable of detecting if the current state was encountered 
earlier (e.g The debugger machine Am)- Further if it can somehow also detect the 
countable Cauchy subsequences Ci, it can then "memorize" the process through 
which the sequences could be regenerated. This would be a "learning" for the 
system in true sense. 

It can go further. The system which wants to concentrate on the subsequence 
Cr to arrive at the solution ar , i.e. CrS limit point, can heuristically "guess" 
and approximate the solution. This can happen even without solving the Turing 
Halting Problem as well. 
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The clustering of points in Xt or the H space, can be taken as the signal that 
the Turing machine won't ever halt with the input. For more pathological cases, 
where the H space would spread the whole of Xt , more sophisticated guesses 
should be needed to heuristically stop and learn. 

7. Summary and Future Works 

In this paper, we have demonstrated that the Universal computation performed 
by the debugger machine is 'almost surely' chaotic. The debugger machine is just 
a trick to formally showcase the inner chaotic nature of the universal computation. 
The implication of this statement has far reaching consequences in the philosophy 
of learning as discussed in Section [6l 

Historically, computation is thought to be all about finding directed answers by 
deterministic, mechanical, step by step procedures. In nature computation arose 
automatically, by the natural force of evolution, over billions of years, without 
any external programming. The solution to this puzzle lies in the idea that any 
mechanism capable of universal computation can be chaotic. 

Chaotic, non halting computation might generate very good approximate solu- 
tions to multiple, unrelated problems. This is under the assumption that a com- 
putation can be halted at will. In nature, this happens frequently where a living 
organism simply dies! But those who survive, takes the learning forward. 

This paper is a small step towards the journey of solving the mystery of nat- 
ural computation. Further research would ensure the continuing quest for the 
most astonishing journey that the humans have ever embarked upon. This is the 
journey to understand ourselves, who, in the immortal words of Carl Sagan are : 
"unconscious matter grew into consciousness". 

Appendix A. Definitions of Section [5] 

Definition A.l. Fixed Point of a function. 

For a function f : X ^ X , x* is said to be a fixed point, iff f{x*) = x* . 

Definition A. 2. Metric Space. 

A metric space is an ordered pair (M, d) where M is a set and d is a metric on 
M , i.e., a f unction :- 

d: M X M 

such that for x,y, z E M , the following holds:- 

(1) d{x,y)>Q 

(2) d{x,y) = iffx = y. 

(3) d{x,y) = d{y,x) 

(4) d{x, z) < d{x, y) + d{y, z) 

The function 'd' is also called "distance function" or simply "distance". 
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Definition A. 3. Cauchy Sequence in a Metric Space (M, d) . 

Given a Metric space (M, d) , the sequence Xi, X2, X3, ... of real numbers is called 
'Cauchy Sequence ', if for every positive real number e , there is a positive integer 
N such that for all natural numbers m,n > N the following holds:- 

Roughly speaking, the terms of the sequence are getting closer and closer to- 
gether in a way that suggests that the sequence ought to have a limit x* G M . 
Nonetheless, such a limit does not always exist within M . 

Note that by the term: sequence we are implicitly assuming infinite sequence , 
unless otherwise specified. We will explicitly use finite sequences in Section HI 

Definition A. 4. Complete Metric Space. 

A metric space {M,d) is called complete (or Cauchy) iff every Cauchy sequence 
(definition ] A. !^} of points in {M,d) has a limit , that is also in M . 

As an example of not-complete metric space take Q , the set of rational numbers. 
Consider for instance the sequence defined by Xi = 1 and function d is defined by 
standard difference between d{x, y) = \x — y\ , then :- 

1 / 2 



This is a Cauchy sequence of rational numbers, but it does not converge towards 
any rational limit, but to \/2 , but then \/2 ^ Q . 
The closed interval [0, 1] is a Complete Metric space. 

Theorem A.l. Banach Fixed Point Theorem [Ij . 

Let (X, d) be a non-empty complete metric space (definition A. 4 ) ■ Let / : X — )■ 
X be a contraction mapping on X , i.e.: there is a nonnegative real number q <\ 
such that:- 

d{f{x)J{y))<q.d{x,y); \fx,y e X (A.l) 

Then the map f admits one and only one fixed point ( definition \A.1\ ) x* in X. 
Furthermore, this fixed point can be found as follows:- 

Start with an arbitrary element € A and define an iterative sequence by 
Xn = f{xn-i) for n = 1,2,3, ... . This sequence converges, and its limit is x* . 

Definition A. 5. Orbit. 

Let / : A — > A be a function. The sequence O = {xq, Xi,X2,X3, ...} where 

Xn+i = f{xn) ; XneX ; n>0 

is called an orbit (more precisely 'forward orbit') of f . 

f is said to have a 'closed' or 'periodic' orbit O if \0\ ^ 00 . 
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Definition A. 6. Topological Space. 

Let the empty set be written as : 0. Let 2^ denotes the power set, i.e. the set 
of all subsets of X. A topological space is a set X together with r C 2^^ satisfying 
the following axioms:- 

(1) e T and X e T , 

(2) T is closed under arbitrary union, 

(3) T is closed under finite intersection. 

The set r is called a topology of X . 

Definition A. 7. Cover (Topology) . 

Let C = {?7q : a G A} is an indexed family of sets U^. 
C is a cover of set X if :- 

Definition A. 8. Basis (Topology). 

A set of sets B = {Ba} are said to be a basis of topology over X if:- 

(1) B covers ( definition \A.'T\ ) X . 

(2) Let -61,-82 £ B. Let x ^ Bi (1 B2. Then there is a basis element B^ such 
that X e B3 and B3 C Bif] B2. 

Definition A. 9. Dense Set. 

Let A be a subset of a topological space X . A is dense in X for any point x E X , 
if any neighborhood of x contains at least one point from A. 

The real numbers M with the usual topology have the rational numbers Q as a 
countable dense subset. 

Definition A. 10. Topological Transitivity (Mixing). 

A function f : X ^ X is topologically transitive (Mixing) if, given any every 
pair of non empty open sets A, B G X , there is some positive integer n such that 

/"(y4) nB^0. 

where /" means n'th iterate of f . 

Definition A. 11. Bounded Sequence. 

A sequence < Xn > is called a bounded sequence iff :- 

Vn I < Xn ^ u ] —00 < I < u < 00 

The number 'I' is called the lower bound of the sequence and 'u' is called the 
upper bound of the sequence. 

Lemma A.l. Bolzano- Weierstrass. 

Every bounded sequence has a convergent (Cauchy) subsequence. 
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It is to be noted that a bounded sequence may have many convergent subse- 
quences (for example, a sequence consisting of a counting of the rationals has 
subsequences converging to every real number) or rather few (for example a con- 
vergent sequence has all its subsequences having the same limit). 

Definition A. 12. Bounded Non-Cauchy Sequence in a metric space. 

A bounded sequence in a metric space is Non-Cauchy, iff the condition of defi- 
nition lX73\ is not met. 

Lemma A. 2. Existence of countable Cauchy subsequences in a Bounded 
Non-Cauchy Sequence. 

A Bounded non-cauchy sequence in a metric space will contain countable number 
of Cauchy subsequences. 

Proof of the Lemma lA.^ . We proof this by construction. Let A/" =< x„ > be a 
non Cauchy bounded sequence. 

By using lemma RTTI we must have one Cauchy subsequence on A^, and itself 
is not a Cauchy sequence (by assertion). We can isolate the cauchy subsequence 
and call it Ci. Now, if one removes the sequence Ci from J\f , we have a new 
sequence A/i. 

However, this A/i is also a bounded sequence, as any subsequence of a bounded 
sequence is also bounded. Then, repeating lemma lA.ll again, we must have one 
Cauchy subsequence on A/i , and we call it C2. That Ci 7^ C2 is certain, because, 
else both the sequences can be merged together. 

We can repeat this procedure, till step n when sequence Afn will not have any 
element left. By that time, we have a set of Cauchy Subsequences:- 

C = {Ci,C2,C3,.:,Cn} ] n>2, 
with the limit points (points of attraction) set:- 



A = {ai, a2, as, a„}; 

with :- 

ai = ttj =^ i = j. 

That is, no Ci,Cj have the same limit, unless i = j ,hj virtue of the construction. 

It is possible if lA/"] — > 00 that the n — 00 , but n remains a natural number, 
and hence, the set C remains countable. The construction of the set 'C proves the 
lemma. □ 

In informal language, any bounded Non-Cauchy sequence is a result of "mixing" 
countable Cauchy sequences, each of them having their own limit points. That 
means, in case there is a function f : X ^ X , whose orbit generates such 
bounded Non-Cauchy sequence, '/' essentially then acts as a "mixer" of multiple 
(but countable) Cauchy Sequences. 
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Appendix B. Definitions for Section S] 

Definition B.l. Turing Machine. 

A "Turing Machine" is a 7-tuple (Q^H^T ^d^qQ^qaiqr), where:- 

(1) Q is the set of states. 

(2) E is the set of input alphabets not containing the blank symbol /3. 

(3) r is the tape alphabet , where /3 G F and S C F. 
f^j^iQxF— j-QxFx {L, i?} is the transition function. 

(5) Qo & Q is start state. 

(6) Qa & Q is the accept state. 

(7) Qr & Q is the reject state. 

According to standard notion Qa ^ Qr , but we omit tliis requirement here, as 
we are not going to distinguish between two different types of halting ('accept and 
halt' vs 'reject and halt') of Turing Machines. 

A Turing Machine 'M' (definition IB.1|) computes as follows. 

Initially 'M' receives the input w = WiW2W3...Wn G S* on the leftmost 't?,' 
squares on the tape, and the rest of the tape is filled up with blank symbol '/3'. 
The head starts on the leftmost square on the tape. As the input alphabet 'S' 
does not contain the blank symbol '/?', the first '/3' marks end of input. 

Once 'M' starts, the computation proceeds wording to the rules of '5'. However, 
if the head is already at the leftmost position, then, even if the '5' rule says move 
'L' , the head stays there. 

The computation continues until the current state of the Turing Machine is 
either , or . In lieu of that, the machine will continue running forever. 

Definition B.2. Decider Turing Machine. 

A Turing Machine, which is guaranteed to halt on any input (i.e. reach one of 
the states {qa,qr} ) is called a decider. 

Definition B.3. Undecidable Problem. 

If for a given problem, it is impossible to construct a decider (definition \B.2\} 
Turing Machine, then the problem is called undecidable problem. 

Definition B.4. Universal Turing Machine. 

An 'UTM' or 'Universal Turing Machine' is a Turing Machine ( definition \B. 1\} 
such that it can simulate an arbitrary Turing machine on arbitrary input. 

Definition B.5. Church Turing Thesis. 

Every effective computation can be carried out by a Turing machine (definition 
\B.1\} . and hence by an Universal Turing Machine (definition B.4 ). 



Definition B.6. Halting problem. 

Given a description of a computer program, decide whether the program finishes 
running or continues to run forever. This is equivalent to the problem of deciding. 
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given a program and an input, whether the program will eventually halt when run 
with that input, or will run forever. 

Theorem B.l. Halting Problem is "undecidable" (Turing). 

There can not be any general decider (definition \B.^} . which can decide on the 
Halting Problem. In other words, the problem of definition \B.6i is "Undecidable" 
( definition \B.3\) . 

Appendix C. Definitions of Section \5\ 

Definition C.l. Sigma ( a ) Algebra. 

Let X be some set, with the notation 2^ is it's power set. Then the subset 
G 2^ is a Sigma (a) algebra ifj:- 

(1) S is non empty. 

(2) S is closed under complementation, if A G , then X \ ^4 C S. 

(3) S is closed under countable union, if Ai, A2, A^, ... C S , then 
AiU A2UA3,... C S. 

Definition C.2. Measure. 

Let Tj be a sigma algebra ( definition I C. 1\) over a set X . A function /i : S — R, 
where R = [—00, +00] is extended real line; is called a measure iff it satisfies the 
following properties:- 

(1) Non Negativity : 

fi{E) > ; V£; G S 

(2) Countable Additivity : For all countable collections {Ei}i^j of pairwise dis- 
joint set in S 




(3) Null Empty Set: 

fi{0) = 

Definition C.3. Measure Space. A measure space is a measurable space, 
that is a set considered together with the sigma-algehra ( definition I C. ij) on the set, 
possessing a non negative measure ( definition I C. ^ ). 

Definition C.4. Continuum Hypothesis. 

Let |Z| = and |M| = Ki. If S is a set, then, 

^S s.t. Ko < \S\ < Ki. 

Definition C.5. Generalized Continuum Hypothesis. 

Let A be a set with \ A\ = , with ordinal a and the power set of A is denoted 
by 2^ , then:- 
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and there is no set S such that Kq, < IS*! < Kq,+i. The ordinal of Z is 0, and 
hence that ofM. is 1, in line with the continuum hypothesis (def C.4 )- 

Definition C.6. Null Set. 

Let X be a measurable space, let fi be a measure on X , and let N be a measurable 
set in X. N is null set(or zero measure), if its measure ^{N) = . 

To be noted that, the set might not be empty set at alL In fact, the set Z 
is null, with respect to the measurable set M, while |Z| = Kg > that is, there are 
countably infinite number of elements in that set. 

Definition C.7. Probability Axioms (Kolmogorov axioms). 

Let ( Q, F, P ) be a measure space, with P{^1) = 1. Then (Q, F, P ) is a proba- 
bility space with sample space Q , event space F , and probability measure P. 

Definition C.8. Almost Surely or With Probability 1. 

Let ( Q, F, P ) be a probability space. One says that an event E E F happens 
almost surely if P{E) = 1. 

Definition C.9. Almost Never or With Probability 0. 

Let ( fl,F, P ) be a probability space. One says that an event E E F happens 
almost never if P{E) = 0. 

Let G F be an Almost Sure event, that is P{S) = 1 . Clearly, the complimen- 
tary event F \ 5" is an Almost Never event with P{F \ S) = . 
We now present theorems to showcase these concepts. 

Theorem C.l. Selection probability of an element from a null set is 0. 

Let N be a null set such that N G X , and X is measurable. If an element x is 
randomly selected from the set X , then:- 

P(x e N)=0. 

In other words P{x ^ N) = 1. 

Proof of theorem IC.il We note that P is a measure over X , and clearly if 
is a null set, then P{N) = by definition. But P{N) is the same as saying 
P(N) = P{x e N). Hence P{x G iV) = 0. But we know that:- 

P{X) = P{x ^N) + P{x G A^). 
Therefore, by rearranging:- 

P(x ^ N)= P{X) - P{x G A^). 
But then P{X) = 1 and P{x e N) = 0. Hence, P{x ^ N) = 1. □ 
The result of theorem IC.ll can be succinctly stated as:- 

"That a randomly drawn number from real line M is 'almost surely' irrational". 
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